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Abstract 


Parallel machine scheduling has been extensively studied in the past decades, with appli¬ 
cations ranging from production planning to job processing in large computing clusters. 
In this work we study some of these fundamental optimization problems, as well as their 
parameterized and reoptimization variants. 

We first present improved bounds for job scheduling on unrelated parallel machines, with 
the objective of minimizing the latest completion time (or, makespan) of the schedule. We 
consider the subclass of fully-feasible instances, in which the processing time of each job, on 
any machine, does not exceed the minimum makespan. The problem is known to be hard 
to approximate within factor 4/3 already in this subclass. Although fully-feasible instances 
are hard to identify, we give a polynomial time algorithm that yields for such instances a 
schedule whose makespan is better than twice the optimal, the best known ratio for general 
instances. Moreover, we show that our result is robust under small violations of feasibility 
constraints. 

We further study the power of parameterization. In a parameterized optimization prob¬ 
lem, each input comes with a fixed parameter. Some problems can be solved by algorithms 
(or approximation algorithms) that are exponential only in the size of the parameter, while 
polynomial in the input size. The problem is then called fixed parameter traetable (FPT), 
since it can be solved efficiently (by an FPT algorithm or approximation algorithm) for 
constant parameter values. We show that makespan minimization on unrelated machines 
admits a parameterized approximation seheme, where the parameter used is the number of 
processing times that are large relative to the latest completion time of the schedule. We 
also present an FPT algorithm for the graph-balancing problem, which corresponds to the 
instances of the restrieted assignment problem where each job can be processed on at most 
2 machines. 

Finally, motivated by practical scenarios, we initiate the study of reoptimization in 
job scheduling on identical and uniform machines, with the objective of minimizing the 
makespan. We develop reapproximation algorithms that yield in both models the best pos¬ 
sible approximation ratio of (1 -|-e), for any e > 0, with respect to the minimum makespan. 
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Chapter 1 
Introduction 


1.1 Scheduling on Parallel Machines 

Consider the following fnndamental problem in scheduling theory. We are given a set J 
of n independent jobs that must be scheduled without preemption on a collection At of m 
parallel machines. If job j is scheduled on machine i, the processing time required is Pij, 
which is a positive integer, for every i G At and j G JT. The total time used by machine 
i G At, or the load on machine i, is the sum of the processing times for the jobs assigned 
to i, and the makespan of an assignment is the maximum load over all the machines. The 
objective is then to hnd a schedule, which assigns each job to exactly one machine, such 
that the makespan is minimized. 

1.1.1 Scheduling Models 

The wide literature on scheduling often distinguishes between the following scheduling 
models. 

Identical Machines. Job processing times are identical across the machines, i.e., Pij = pj 
for all j E J and f G AJ. 

Uniform Machines. Each machine i has a speed Sj. The length of job j on machine i 
is some uniform processing time pj scaled by the speed Sj, i.e., Pij = ^ for all j E J and 
ie M. 

Unrelated Machines: Each job j may have an arbitrary processing time Pij > 0 on 
machine i, for j ^ J and i G A4. 

While makespan minimization is known to be NP-hard in all models (even for m = 2) 
[LK79] . the hrst two models are considered somewhat easier, since the problem can be ap¬ 
proximated efficiently in both up to some e factor, for any e > 0 (see Section ri.l.2p . In con¬ 
trast, in the unrelated machines model, the problem becomes hard to approximate within 
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a factor better than |. Moreover, since 1990, when the state of the art 2-approxiniation 
algorithm was presented by Lenstra, Shmoys and Tardos |LST90j , there was no signihcant 
improvement on either the upper or lower bound, although the problem was consistently 
investigated. This led researchers to consider special cases and improving the bound of 
2, either by a constant factor, or by some function of the input parameters (see review in 
Section ll.l.2p . 

In this work, we consider the subclass of fully-feasible instances. We say that an instance 
is fully-feasible if job processing times, pij, do not exceed the length of the optimal schedule 
for the instance, for every job j G JT and machines i E Ai. Observe that an optimal 
schedule never assigns a job to machine on which its length is greater than the makespan of 
an optimal schedule; thus, from an optimal scheduler’s viewpoint, if Pij exceeds the optimal 
makespan then is considered to be oo. 

We also consider instances that are almost fully-feasible, that is, for any job j, the number 
of machines on which job j is not feasible (i.e., has processing time larger than the length 
of the optimal makespan) is relatively small. 

When considering real-life applications, the general model of unrelated machines, which 
makes no assumptions on job processing times, seems too broad. Indeed, such applications 
usually deal with fully-feasible (or almost fully-feasible) workloads, as they commonly han¬ 
dle relatively large sets of jobs. 

Let Topt and Lopt denote the optimal makespan and the minimal average machine load 
over optimal assignments, respectively. For heterogeneous workloads of a huge number of 
jobs, in which the makespan is counted in months or even years, the processing time of a 
given job is negligible compared to the makespan; for such workloads, we have Pij <C Topt- 
In this case, an algorithm of |ST93j . yields a schedule of makespan at most Topt + Pmax, 
where Pmax = maXijPij, is more suitable. However, for smaller sets of jobs, ptj can be large 
relative to Topt, such that Lopt < Pmax, and for these instances our algorithm is the state of 
the art. Relevant applications for such workloads are e.g., job packing in warehouse-scale 
|VKW14j . large-scale clustering IVP-1-151 and applications in parallel design patterns such 
as Fork-Join and MapReduce (see, e.g., |DMN121 |LL-f 12| ). 


1.1.2 Related Work 

Identical and Uniform Machines The problem of makespan minimization on identical 
or uniform machines is known to be NP-hard [GJ79] . A polynomial-time approximation 
scheme (PTAS) is a family of algorithms {A^ : e > 0}, where A^ is a (1 J- e)-approximation 
algorithm that runs in time polynomial in the input size but is allowed to be exponen¬ 
tial in An efficient polynomial-time approximation scheme (EPTAS) is a PTAS with 
running time f{-)poly{\X\), where \X\ is the input size, (for some function /), while a 
fully polynomial-time approximation scheme (FPTAS) runs in time poly{^, |X|). Since the 
scheduling problem is NP-hard in the strong sense already on identical machines (as it con¬ 
tains bin packing and 3-Partition as special cases) [GJ79] . we cannot hope for an FPTAS. 
For identical machines, Hochbaum [H96] and Alon et ah | AA-1-98| gave an EPTAS with 
running time /(^) J- 0{n), where / is doubly exponential in i, and for uniform machines. 
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Jansen |J10j gave an EPTAS with rnnning time +poly{\X\). 


Unrelated Machines A classic result in scheduling theory is the Lenstra-Shmoys-Tardos 
2-approximation algorithm for makespan minimization |LST90] . They also proved that 
the problem is NP-hard to approximate within a factor better than |. Gairing et ah 
[GMW07] presented a more efficient, combinatorial 2-approximation algorithm based on 
flow techniques. Shchepin and Vakhania |SV05j showed that the rounding technique used 
in |LST90] can be modified to derive an improved ratio of 2 — Shmoys and Tardos 
[ST93] showed an approximation algorithm that yields a schedule of makespan at most the 
length of an optimal schedule plus the largest processing time of any job in the instance. 

Although makespan minimization on unrelated machines is a major open problem in 
scheduling theory, and is extensively studied, there was no significant progress on either 
the upper or lower bound for over two decades, since the publication of [LST90]. This 
led researchers to consider interesting special cases and improving the upper bound for 
them. A well known special case is the restricted assignment problem, where jobs have 
processing times Pij G {pj,oo}. Svensson |S12] gave a polynomial-time algorithm that 
approximates the optimal makespan of the restricted assignment problem within a factor 
of If + f ~ 1-94 -|- e for e > 0 , and also presented a local search algorithm that will 
eventually find a schedule of the mentioned approximation guarantee, but is not known to 
converge in polynomial-time. Gairing et al. |GL-h04| presented a combinatorial (2 — ^ 7 ^)- 
approximation algorithm for the restricted assignment problem. 

Ebenlendr et al. considered in |EKS08] the graph balancing problem, a special case of 
the restricted assignment problem where each job j has a finite processing time, pij < 00 , 
on at most two machines. The paper gives an elaborate 1.75-approximation algorithm for 
the problem. The authors also show that the problem is hard to approximate within a 
factor less than | even on bounded degree graphs, i.e., when the maximum degree is some 
constant. In the unrelated graph balancing problem, introduced by Versache and Weiss 
|VW14j . each job can be assigned to at most two machines, but processing times are not 
restricted. They showed that this subclass of instances constitutes the core difficulty for the 
linear programming formulation of makespan minimization on unrelated machines, often 
used as a first step in obtaining approximate solutions. Specifically, they showed that the 
strongest known LP-formulation, namely, the configuration-LP, has an integrality gap of 2. 

Vakhania et al. |VMH14j considered makespan minimization on unrelated machines for 
the subclass of instances where job lengths can take only two values, p and q, which are 
hxed positive integers, such that p < q. They presented a polynomial-time algorithm that 
uses linear programming with absolute approximation factor of q (i.e., all schedules have 
makespan at most OPT + q). Page |P14] considered restricted assignment instances with 
processing times in a fixed interval, \p,q], and gave a ^-approximation algorithm, and a 
|-approximation algorithm for the case where pij G {1, 2, 3}. Ghakrabarty et al. IGK-H5I 
considered instances with two types of jobs: long and short, namely, pij G {l,e} for some 
e > 0. They obtained a (2 — J)-approximation algorithm for such instances. 

Shmoys and Tardos |ST93j considered the generalized assignment problem (GAP), where 
each job j incurs a cost of > 0 when assigned on machine i, and the objective is to 
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minimize the makespan and the total cost. The paper |ST93j presents a polynomial-time 
algorithm that hnds a schedule of makespan at most twice the optimum with optimal cost. 
A summary of the known results for unrelated machines is given in Table 11.11 


Result 

Authors 

Restrictions on the Unrelated Model 

2 -approximation 

|T.ST9n] 


Hard for factor < | 

|TRT9n] 


A bound of Topt Pmax 

|ST93] 


(2 — ^)-approximation 

|svn5] 


Integrality gap <1.95 

|S12 

Pij e {pj, oo} 

1.75-approximation 

[FKSnS] 

Pij G {pj, oo}, pij < cx) on < 2 machines 

(2 — (5)-approximation, 5 > 0 

ICK+151 

Pij G {1, e} for some e > 0 

A bound of Topt + q 

[VMH14] 

Pij G {p, q} for some p < q 

- & ^-approximation 

|P14| 

Pij G [p,g] k Pij G {1,2,3} 

A bound of Topt + 

This Work 

Pij — Tqpi 

A bound of Topt + ^ 

This Work 

<p > ^, for minimal feasible values of T & L 

A bound of Pmax + ^ 

This Work 

Pij G {pj, oo} 


Table 1.1: Known results for makespan minimization on unrelated machines. 


1.2 Fixed Parameter Algorithms 

Parameterized complexity is a branch of computational complexity theory that focuses on 
classifying computational problems according to their inherent difficulty with respect to 
multiple parameters of the input. The complexity of a problem is then measured as a 
function in those parameters. This allows the classihcation of NP-hard problems on a hner 
scale than in the classical setting, where the complexity of a problem is only measured by 
the number of bits in the input (see, e.g., |DF12] ). 

Under the assumption that P ^ NP, there exist many natural problems that require 
super-polynomial running time when complexity is measured in terms of the input size only, 
but that are computable in a time that is polynomial in the input size and exponential (or 
worse) in a parameter k. Hence, if k is hxed at a small value, and the growth of the function 
over k is relatively small, then such problems can still be considered “tractable” despite 
their traditional classihcation as “intractable”. 

Some problems can be solved exactly, or approximately, by algorithms that are exponen¬ 
tial only in the size of a hxed parameter while polynomial in the size of the input. Such 
an (approximation) algorithm is called a fixed-parameter tractable (FPT) (approximation) 
algorithm, because the problem can be (approximately) solved efficiently for small values 
of the hxed parameter. 

Problems in which some parameter k is hxed are called parameterized problems. A 
parameterized problem that allows for such an FPT algorithm is said to be a fixed-parameter 
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tractable problem and belongs to the class FPT. 


1.2.1 Parametrized Scheduling Problems 

Despite the fnndamental natnre of schednling problems, and the clear advantages of fixed- 
parameter algorithms, no snch algorithms are known for many of the classical schednling 
problems. One obstacle towards obtaining positive resnlts appears to be that, in contrast to 
most problems known to be hxed-parameter tractable, schednling problems involve many 
nnmerical inpnt data (e.g., job processing times, release dates, job costs), which alone 
canses many problems to be NP-hard, thns rnling ont hxed-parameter algorithms. 

1.2.2 Related Work 

While minimizing the makespan on identical and uni form machines admits an EPTAS, 
see e.g., |AA-k981 IJlOj . the rnnning times of these approximation schemes usnally have a 
bad dependence on e. In addition, these problems are strongly NP-hard |GJ79j . there¬ 
fore we cannot hope to obtain an FPTAS. These considerations call for hnding which 
schednling problems are hxed-parameter tractable (FPT). This amonnts to identifying 
instance-dependent parameters k that allow for algorithms that hnd optimal solntions in 
time f{k) ■ poly{\X\), for instances X and some fnnction / depending only on k. 

As for schednling on nnrelated machines, an FPTAS is known |HS76] . bnt only when 
assnming a hxed nnmber of machines. Note that, if the nnmber of machines or the nnmber 
of processing times are constant, the problem is still NP-hard |LST90j . and thns no FPT- 
algorithms can exist for these choices of parameters. This motivates ns to identify instance- 
dependent parameters A;, while assnming an arbitrary nnmber of machines, that allow for 
better FPT approximation algorithms. 

Marx |M11] proposed the research direction of schednling with rejection, where the pa¬ 
rameter k is given with the inpnt for the schednling problem, and the solntion has to 
schednle all bnt k jobs. This direction was explored recently by Mnich and Wiese |MW13] . 
who presented for the hrst time a hxed-parameter algorithms for classical schednling prob¬ 
lems snch as makespan minimization, schednling with job-dependent cost fnnctions and 
schednling with rejection. For the problem of makespan minimization on identical ma¬ 
chines, the paper |MW13j presents an FPT algorithm, where the parameter k dehnes an 
npper bonnd on the nnmber of distinct processing times appearing in an instance. For the 
more general model of nnrelated machines, the paper gives an FPT algorithm, nsing the 
nnmber of machines and the nnmber of distinct processing times as parameters. 
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Table 11.21 summarizes the known results. 


Result 

Parameters 

Authors 

Model 

FPT algorithm 

maXjPj 

|MW13] 

identical machines 

FPT algorithm 

m, ^distinct Pij 

|MW13j 

unrelated machines 

FPT approximation scheme 

\{{i,j) • Pij > ^T}\, feasible T 

This Work 

unrelated machines 

FPT algorithm 

treewidth, degree 

This Work 

graph balancing 


Table 1.2: Known FPT algorithms for scheduling. 


We are not aware of any other work in this area. 


1.3 Combinatorial Reoptimization 

Reoptimization problems naturally arise in many real-life scenarios. Indeed, planned or 
unanticipated changes occur over time in almost any system. It is then required to respond 
to these changes quickly and efficiently. Ideally, the response should maintain high perfor¬ 
mance while affecting only a small portion of the system. Thus, throughout the continuous 
operation of such a system, it is required to compute solutions for new problem instances, 
derived from previous instances. Since the transition from one solution to another incurs 
some cost, a natural goal is to have the solution for the new instance close to the original 
one (under certain distance measure). 

We use the reoptimization model developed by Shachnai et ah |STT12] . In this model, 
we say that A is an (r, p)-reapproximation algorithm if it achieves a p-approximation for 
the optimization problem, while incurring a transition cost that is at most r times the 
minimum cost required for solving the problem optimally. 

1.3.1 Reoptimization in Scheduling 

We consider instances of scheduling problems that can change dynamically over time. Our 
goal is to compute assignments within some guaranteed approximation for the new problem 
instances, derived from the previous instances. Since the transition from one assignment 
to another incurs some cost (for example, the cost of pausing the execution of a job on one 
machine and resuming its execution on another), an additional goal is to have the solution 
for the new instance close to the original one (under a certain distance measure). 

1.3.2 Related Work 

Reoptimization Shachnai et ah |STT12] presented reapproximation algorithms for sev¬ 
eral non-trivial classes of optimization problems. This includes a fully polynomial time reap¬ 
proximation schemes (FPTRS) for DP-benevolent problems, reapproximation algorithms 
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for metric Facility Location problems, and (1, l)-reoptimization algorithm for polynomially 
solvable subset-selection problems. 

Junosza-Szaniawski et al. |JLR15] considered a variant of a recoloring problem, called 
the r-Color-Fixing. They investigated the problem of hnding a proper r-coloring of a graph, 
which is “most similar” to some given initial solution, i.e. the number of vertices that have 
to be recolored is minimum possible. They provide a (1, l)-reoptimization algorithm for 
the problem. More work on reoptimization can be found e.g., in |J151 |AE-H4| . 


Reoptimization in Scheduling Baram and Tamir |BT14] considered the problem of 
scheduling on identical machines with the objective of minimizing the total flow time, 
i.e., minimizing where Cj is the completion time of job j. They presented an 

algorithm that yields an optimal solution using the minimal possible transition cost, and 
an algorithm that outputs the best possible schedule, using a given limited budget for the 
transition, for several classes of instances. 

Bender et al. IBF-H3I focused on a scheduling problem where each job is unit-sized and 
has a time window in which it can be executed. Jobs are dynamically added and removed 
from the system. They presented an algorithm that reschedules only 0{min{log*n, log*A}) 
jobs for each job that is inserted or deleted from the system, where n is the number of active 
jobs, and A is the size of the largest window. 


1.4 Main Results 

Our hrst contribution is in the study of makespan minimization on unrelated machines, 
which leads to approximation algorithms with performance guarantees better that 2, the 
best known bound for general instances. In particular, for the subclass of fully-feasible 
instances, we present (in Chapter 2) an LP-based algorithm that achieves makespan at 
most Topt + Lopt, where T^pt and Lgpt are the minimal makespan and minimal average 
machine load for this makespan, respectively. This result is better than twice the minimal 
makespan for instances naturally arising in real-life applications. It also improves the 
minimal makespan plus the maximum processing time of a job, for instances where the 
average machine load is smaller than the maximum processing time of any job. We show 
that our algorithm is robust in the sense that it achieves an improved makespan also for 
instances that are almost fully-feasible. In such instances, each job may exceed the minimal 
makespan on a small number of machines. Formally, we dehne the feasibility parameter of 
a general instance X, denoted as the minimal fraction of machines on which a job has 
a processing time at most Topt, i.e., (p(X) = ^ present an algorithm 

that yields, for instances with large enough feasibility parameter, a schedule of makespan 
at most Topt + For instances of the restricted assignment problem, i.e., for instances 

with processing times pij G {pj, cx)}, we show that a bound of Pmax + is obtained by 
an efficient and simple combinatorial algorithm, where Pmax is the largest processing time 
of any job in the instance. 

We further study the power of parameterization and present an FPT approximation 














scheme, i.e., a (1 + e)-FPT approximation algorithm, for makespan minimization on un¬ 
related machines parametrized by the number of machine-job pairs, {i,j) E Ai x J', such 
that Pij > eT, for some makespan candidate T. We also show that the graph-balancing 
problem, parameterized by treewidth and the maximum degree of the graph, is in FPT. 
These results are presented in Chapter 3. 

Our third contribution is reapproximation algorithms for the reoptimization variants of 
makespan minimization on identical and uniform machines, which are studied here for the 
first time. Specifically, we develop (1, l-|-e)-reapproximation algorithms, namely, algorithms 
that achieve a ratio of (1 + e) to the minimum makespan, and the minimum transition cost, 
in both the identical machines and the uniform machines models, where transition costs 
can take values in {0,1}. For the uniform case, we assume that the ratio between the 
highest and the lowest machine speeds is bounded by some constant. Thus, our algorithms 
achieve the best possible ratio with respect to the makespan objective in these models. For 
the unrelated machines model, we note that an algorithm of Shmoys and Tardos [ST93] 
can be used to obtain a (1, 2)-reapproximation, thus matching the best known bound for 
makespan minimization also in this model. We summarize the results for reoptimization in 
scheduling in Table 11.31 The results are given in Chapter 4. 


Result 

costs 

Authors 

Model 

(1, 2)-reapproximation algorithm 

arbitrary 

|ST93j 

unrelated machines 

(1,1 -|- e)-reapproximation algorithm 

{0,1} 

This Work 

identical machines 

(1,1 -|- e)-reapproximation algorithm 

{0,1} 

This Work 

uniform machines with — < b 

Sm 


Table 1.3: Our contribution for reoptimization in scheduling. 
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Chapter 2 

Makespan Minimization for 
Fully-Feasible Instances 


2.1 Preliminaries 


Consider a scheduling instance X = consisting of a set of m machines M and a 

set of n jobs JT” with non-negative integers pij denoting the processing time of job j & J 
on machine i G M. An assignment of the jobs to the machines is a bijection a : JT —)■ At 
where a{j) = i ii and only if job j is assigned to machine i. For any assignment a, the load 
on machine i under assignment a, denoted as load„{i), is the sum of processing times for 
the jobs that were assigned to machine i. Thus, load„{i) = Ylij£j-a{j)=iPi 3 - The makespan 
of an assignment cr, denoted by T{a), is the maximum load over all the machines. Thus, 
T((t) = maXi(zMloada{i). The average machine load of an assignment a, denoted by L{(j), 
is given by L{a) = _ Given an instance X = (jT”, At), we denote by Topt the 

optimal makespan, i.e., Topt = 'mino:M^jT{o'), and we denote by Lopt the minimum average 
machine load for an optimal assignment, i.e., Lopt = -.m^j,T{ a*)=Toj,tL{(^*)■ 

Given an instance X = (jT”, At), we say that a job j is feasible on machine i, if and only if 
Pij < Topt- The feasibility parameter of X, denoted v^(X), is the minimal fraction of feasible 

1 . r • • u • i'T\ ■ is feasible on i}| rpu • u 

machines tor any given job, i.e., ^[-L) = miUj^j— - - - — -thus, every job 

j G JT” is feasible on at least ^(T) ■ m machines. In this terms, an instance X is fully-feasible 

if and only if (p{T) = 1. We often omit X in the notation if it is clear from the context, and 

refer to the feasibility parameter as (p. 

Given an instance X = (JT”, At), an assignment a \ J ^ At, two positive integers X and 
T and some real number 7 > 1, we denote by Bad{a, 7 ) X At the subset of machines i with 
loadoii) > T -|- 7 • X, by Good{a,'^) C At the subset of machines i with loadoii) < 7 ■ X, 
and for all j G J, by Goodj{a,'-^) C Good^a,^) the set of machines from Good{(j,'y) that 
are also legal for job j. For every i E Ai we denote by flnaxi^) = argmax {ptj : a(j) = i} 
the job with the longest processing time, assigned, by a, on machine i. 
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2.2 Approximation Algorithm for Fully-Feasible In¬ 
stances 

Given positive integers L and T, let Xij be an indicator to the assignment of job j on 
machine i. Consider the following linear program. 


m n 

LP{T,L): 


m 

1 , 

for j = 1,..., n 

i=l 




hi 

VI 

for i = 1,..., m 

J=1 



o' 

Al 


for i = 1, ...m , j = 1,..., n 


One can see that integer solutions to the above LP are in one to one correspondence with 
assignments a : Ai ^ oi average machine load at most L and makespan at most T. 

Theorem 2.1. If LP{T, L) is feasible for some L < T, then there is a polynomial-time 
algorithm that yields a schedule a with 

1. L{a) < L 


2. loadaif) < T + max{pij : a{i) = i}, Wi E A4 


Proof. Let x = {xij : i G Ai, | G JT”) be a fractional solution to LP(T,L). We round x to 
an integer solution using the following classic rounding technique, also used in |ST93] . Let 


ki = 


E 


j&J 


Each machine is partitioned into ki sub-machines s = 1,fcj. 


An edge weighted, bipartite graph B = (W, V; E) is constructed, where W = {wj : j = 1,..., n} 
representing the jobs and V = {ujs : i = 1,m, s = 1,..., k^} representing the sub-machines. 

The edges are constructed in the following way. 

Consider the nodes Vis as bins of unit capacity and the nodes Wj, as pieces of size Xij. For 
every machine i E Ai, consider the nodes in non-increasing order of the processing time of 
the corresponding job, on machine i. For convenience, assume pn > pi 2 > ... > Pin- An 
edge {wj,Vis) with cost pij is constructed if and only if a positive fraction of Xij is packed in 
the bin Vig. The packing of the bins (and construction of the edges) is done in the following 
way. The bins Vn ,..., Ujfc. are packed one by one, with the pieces in the order pn^pa, Pin- 
While Vis is not totally packed, (otherwise we consider the next bin) we continue packing 
the next piece such that if its size, Xij, hts Vis (without causing an overflow) it is packed to 
Vis, else, if it causes an overflow, only a fraction /3 > 0 of Xij is packed to Vis, consuming 
all the remaining volume of Vis, and the remaining part of (1 — f3)xij is packed in 
Figure [2T] gives a pictorial example of this construction. 
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Figure 2.1: Example of the construction in the rounding 


The rounding is done by finding a minimum-cost integer matching M ^ E that matches 
all job nodes, and for every edge {wj, Vis) G M, set a{j) = i i.e., assign job j on machine i. 

By taking a minimum-cost integer matching it is guaranteed that ^ Ylij-a{j)=iPij — 
L, or L{a) < L. By the construction of the graph it is guaranteed that the load on 
machine i, for every i G Ai, is at most max{pij : a{j) = i} + ■' where = 

max{pij : {wj,Vis) G E}. Since ^ ^ detailed proof, see |ST93] j. we get 

that loada{i) < max{pij : a{i) = i} + T for all i G Ai. □ 

This result will be helpful in our approximation algorithm. 

2.2.1 Approximation Algorithm 

Recall that an instance X is fully-feasible if and only if pij < Topt for every job j & J 
and machine i G Ai. Although fully-feasible instances are hard to identify, we give a 
polynomial-time algorithm that yields for such instances an assignment whose makespan 
is better than twice the optimal makespan, the best known ratio for general instances. 

Our main result is as follows. 

Theorem 2.2. There is a polynomial-time algorithm that yields for fully-feasible instances 
an assignment of makespan at most Topt + Lopt- 

Note that always Lopt < Topt. Lopt = Topt occurs only in case where every optimal 
assignment is perfectly balanced (i.e., the load on all the machines equals Topt). Thus, in 
the typical case we get Lopt < Topt, which means that in the typical case our algorithm 
guaranteed a bound that is strictly better that twice the optimum. 

The following theorem shows that the problem remains hard already when considering 
only the class of fully-feasible instances. 
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Theorem 2.3. Makespan minimization on unrelated machines is hard to approximate 
within factor < | , already in the class of fully-feasible instances. 

The proof of the hardness result follows from a reduction from 3-dimensional matching 
as in |LST90j . 

We prove a stronger result, that also shows that our result is robust under small violations 
of the feasibility constraints. Namely, we show that we can get better bounds also when 
our instance is not fully-feasible but has the property that every job can be non-feasible on 
some small fraction of the machines. For this, we will need some dehnitions. 

Definition 2.1. Job j & J is feasible on machine i G M if Pij < Topt 

Definition 2.2. The feasibility parameter of instance X is defined as 

. \{zeM:j is feasible on i]\ 

(p(I) = - 

m 

In these terms, we have that every job j G JT” is feasible on at least 9 ?(X) • m machines. 
Moreover, an instance X is fully-feasible if and only if (p(X) = 1. 


Given an instance X, we can £x L and T to be the minimal values such that LP{T, L) is 
feasible in polynomial-time in the size of the input |X|, using binary search on some feasible 
regions for L and T. At the end of this operation, L and T will satisfy L < T, L < L^pt 
and T < Topt- 


Theorem 2.4. There is a polynomial-time algorithm that, given the values L <T, yields 
for instances X with feasibility parameter (p(X) > ^, an assignment of makespan at most 

^~n _[_ Lopt 

^ opt 


Corollary 2.5. The statement of theorem AJ. 2 follows directly from \2.4 
parameter of fully-feasible instances is 1. 


since the feasibility 


Given a general scheduling instance X, its optimal makespan, Tgpt, nor its feasibility 
parameter, (p{X), cannot be computed in polynomial-time, unless P = NP. However, let 
i be the number of distinct processing times Pij of an instance X and w.l.o.g assume that 
Pi < P 2 < ••• < are the distinct processing times of X. If we denote pi+i = oo, then we see 
that Topt can belong to exactly one of the regions \pt,pt+i) for some t G {1, ...,£}. If Topt G 
[pt,pt+i) for some t G {1, ■■■,£}, then job j G JT” is feasible on some machine i G Ad if and 
only if Pij < pt. Therefore, Topt e [PuPt+i) if and only if Lp{X) = (pt ^ miuj^zj _ 


We prove the theorem by describing an algorithm that admits the desired bound on 
the makespan, for instances with large enough feasibility parameter. The hrst step of the 
algorithm is to hnd the minimal T and L such that LP{T, L) is feasible. Next, it obtains a 
fractional solution to LP{T,L) and rounds it to obtain an initial assignment a, such that 
X(cr) < L and loada{i) < T -\- max{pij : a{j) = z}, Vz G Ad, as in Theorem 12.11 Then, 
for each guess of the optimal makespan region and the corresponding feasibility parameter. 
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it tries to fix the initial assignment to achieve a new assignment of makespan at most 
Topt + (and unless the feasibility parameter is too large, it will succeed for the right 
guess). This is done by balancing the initial assignment such as to reduce the load of the 
overloaded machines to meet the desired makespan. 

We hrst prove the following lemmas. 

Lemma 2.6. Let a he an assignment for some instaneeX = (jf,M) such that L{a) < L, 
let T > L and let ■y > 1. Denote k = \Bad{a,y)\. Then 


2. \Good{(j,y)\ > (^1 - - m + ^ ■ f. 

Proof. Each machine i G Bad{a, 7 ) has load greater than T +y-L, therefore J2ieM load„{i) > 
k-{T + yL). 

1. Assume that k > then 
— 7+1 ’ 


J2km load„{i) > k(T + 7 L) 

>yi(T + lL) 

= + tL) 

>7r(i + 7r) 

= m ■ L 

The last inequality follows from the fact that T > L. Hence, the average machine 
load is greater than L, a contradiction. It follows that k < 

2. Let \Good{a,y)\ = 1. Then, there are m — k — I machines having loads greater than 
yL. Assume that / < m + ^ ■ ^, then 

> k{T + y ■ L) + {m - I - k)yL 
= kT + {m — 1) yL 

> ir + - (^1 - i) m + y f) 7L 

>kT+\y + iy)~,L 

= kT + {mL + kT) 

> mL 

Hence, the average machine load is greater than L, a contradiction. It follows that 
|G'ood(a, 7 )| > (l - m + ^ 

□ 

Lemma 2.7. Let X = (JL, Ad) be an instance with feasibility parameter ip. Let a he an 
assignment for X with average machine load L and let T > L. If p > ^ then for every 
subset A C Bad{a, -), |A^(A)| > \A\, where N (A) is the set of neighbors of A in G^i- 

‘■P ’ p, 
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Proof. Let 
(1 — (^)m, t 


Bad{a, i) 


= k. Since the number of illegal machines for any job j is at most 


le number of good machines for job j is at least the number of good machines 
minus its illegal machines (the worst case where all illegal machines for job j are contained 
in Good{a, ^)). Together with Lemma [2.61 we have 


Goodj{a, i) > Good{a, - (1 - (p)m 


> 1 - 


> k 


m + 


(JJ 


(1 — (p)m 


The last inequality follows from the fact that Now, let A C Bad{a, ^). Then 


1^1 < Bad{a, 


= k. 


Recall that the set of neighbors of A is the set of machines that are good for all the 
jobs i e A, i.e., N (A) = UgAGoodji ^ (a, b) C Good{a,^). Obviously \N {A)\ = 


> 


Goodii 


I -j't I * J 

J max ' ' ip ' 


iV(A)| > k. 

Since 1^41 < k we have that \N (A)! > 1^41. 


3 max 

for some i ^ A. It follows from the above that 


□ 


By Hall’s Theorem |H35] . there exist a perfect matching in G^ i if and only if for every 
A C Bad(a, ^), \N (H)| > |H|. Thus, we have 


Corollary 2.8. There exists a perfect matching in G^i. 

’ V 


By the above discussion, we can modify the initial assignment a, by hnding a perfect 
matching in G^ i and then transferring jobs from bad machines to their matching good 
machines. We describe this formally in algorithm Aum- 
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Algorithm 1 Aum 

1. Use binary search to find the minimal T, such that LP{T,T) is feasible. Next, with 
that T fixed, search for the minimal L such that LP{T, L) is feasible. 

2. Solve the linear relaxation LP{T,L). 

3. Round the solution to obtain an integral assignment a using a rounding technique as 
given in Theorem 12.11 


4. For every t = 1, where i is the number of distinct processing times of X, guess 
that Topt e \pt,Pt+i) and that ip{X) = ipt. 

(a) If ipt < ^, continue. 

(b) Otherwise, construct the bipartite graph j_ and find a perfect matching of 


size 


Bad{a, , if one exists. If not, continue. 


(c) Obtain a resulting assignment a' from a by transferring the longest job, j* 
from each machine i G Bad{a, —), to its matching machine i G Good{a, —). 


5. Return the assignment a' with minimal makespan. 


Proof of Theorem 12.4L We show that the assignment output by Algorithm Aum satisfies 
the statement of the theorem. Consider a general instance X. By performing binary search 
on a feasible bounded region of the optimal makespan we can find the minimal T for which 
LP{T, T) is feasible, and then by performing binary search on a feasible bounded region of 
the optimal average machine load we can find the minimal L for which LP{T, L) is feasible. 
These integers satisfy that T < Topt, L < Lopt and L <T. 

By Theorem 12.11 since LP{T, L) is feasible then Step [3] is guaranteed to generate an 
assignment a of L{a) < L and loada <T + max{pij : a(j) = i}, for all i G M.. 

Let ^ be the feasibility parameter of X. Then for some t = where i is the 

number of distinct processing times in X, ipt = P- Then, by Corollary 12.81 there exists a 


perfect matching in G^ J_ and we will find it in Step 4(6) in Aum- Let k = 

’ vt 

and let M = be a perfect matching in j_. 


Bad{a, 


For any machine i = I, ..., m, loada{i) < T + max{pij : a{j) = i}. Let be the largest 
job processed by a on machine i. Then, = max{pij : a{j) = z}, thus removing 

from machine i guarantees that the new load of machine i will be at most T. 

As for the good machines, if z is a good machine for job j then < pt. Therefore, 
transferring j to z will increase the load of z by at most pt which is at most Topt (since 
Topt £ [Pt,Pt+i))- Since the load of a good machine is at most we have that after such 
a job transfer the load will be at most Topt + 

The load on the rest of the machines stays unchained, i.e., loado{i) < X + ^, for all 
z ^ Bad(a, ■T'j u Good{a, ^). 
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Thus, by performing the large-jobs transfers for all pairs (4^5^ ■^ = 15 we 

obtain a new assignment a , with load^' (i) < max{T, Topt T-|- for alH G M., which 

is at most T^pt + since L < Lopt and T < Topt- □ 

Theorem 2.9. The complexity of Aum is 0{{nm)i ■ log‘^{J2{ij)£MxjP->-j))- 

Proof. We will show that Step 1 in Aum is the bottle-neck of the algorithm. In this step we 
perform a binary search on the feasible regions of T and L while solving LP{T, L). Since the 
feasible region for both T and L is [0, we solve the LP 0{log{Yli(i,j)(zMxjPij)) 

times. Solving the LP can be done in time 0{{nm)i ■ logif^^^ Pij)) |K84j . so overall 

this operation runs in 0((um)i ■ log‘^{Yj{i,j)<EMxjPii))- 

The number of vertices in the bipartite graph j_ equals to the number of bad and 

’ vt 

good machines, which is at most m. From [2^ the number of bad machines is at most 
thus the number of edges in the bipartite graph is at most J2i£Bad(cr ~ — f — I 

(the last two inequalities are due to (p < 1). Therefore finding a perfect matching in j_ 

’ vt 

can be done in time 0{. \V^±_ \ ■ \E^j_\) = 0{y/m) |MV8n] . In Aum, we find a perfect 

V ' vt ’vt 

matching for every t = 1, ..,i, where i < n ■ m is the number of distinct processing times. 
Thus, the complexity of Step 4 sums to 0{nm^/m). 

Step 2 is done in 0((nm)2 ■log{J2{i j)eMxjPij)) and Step 3 is done in 0{nf{m + nY) 

|EK72] . 

Therefore, the overall complexity of the algorithm is 0((nm)2 ■ log"^j)eMxjPij))■ 

□ 
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2.3 A Better Bound for the Restricted Assignment 
Problem 

In this section we consider the restricted version of onr problem, where Pij G {pj, oo}, for 
each job j G and each machine i G M. This subclass is NP-hard to approximate within 
a factor better than |, which is also the best known lower bound for the general version 
[LSTQO] . In the restricted version, any job j with processing time pij < oo is feasible on 
machine i, since pj < Topt for every j G J. Hence, the feasibility parameter of a restricted 
instance X is exactly </?(X) = ^ which can be computed efficiently. Fully- 

feasible restricted instances correspond to the identical machines instances, hence we do 
not consider especially fully-feasible instances in this case. Also, we say that an assignment 
cr : AT —)■ is feasible if pij < oo for every machine i and job j such that a{j) = i. 

For this variant, we show that a better bound than in Theorem 12.41 can be achieved 
by a much simpler and more efficient combinatorial algorithm, and for every feasibility 
parameter. Denote by Pmax = maXj^jPj the largest processing time of some restricted 
instance X. Gairing et al. |GL+04| presented a (2 — ^^A_j_app];oximation algorithm for the 
restricted assignment problem. Using techniques from |GL+04| , we obtain an approxima¬ 
tion algorithm which yields an assignment of makespan at most Pmax + where </9, is 
the feasibility parameter of the instance. 

Overview of the Algorithm of Gairing et al. We describe below the procedure UBJ^, 
used in | GL-f04| . Let X be an instance for the restricted assignment problem. Let A be an 
integer that will be determined by binary search, to be a lower bound on Topt- 

Let (T be a feasible assignment and let = iW U V, E^) be a directed bipartite graph 
where W = {wj ■. j & J} consists of the job nodes, and U = {uj : i G AT} consists of the 
machine nodes. For any job node Wj and any machine node Vi, if a{j) = i there is an arc 
in Eo oriented from Vi to Wj] if a{j) ^ i and j is feasible on machine i, then there is an arc 
in Eo oriented from Wj to Vi- 

Given a feasible assignment cr, consider the partition of machines into three subsets: 
(overloaded), AT “(cr) (underloaded), and AT °(cr) (all the remaining machines). A 
machine i G AT’''(cr) is overloaded if loadoif) > Pmax -|- A -|- 1. A machine i G AT“(cr) is 
underloaded if load^ (*) < A. The remaining machines, which are neither overloaded nor 
underloaded, form the set AT°(cr) = AT \ (AT“(cr) 

The procedure UBE{a, A) starts with some initial feasible assignment of jobs to machines 
and iteratively improves the makespan until it obtains an assignment with makespan of 
Pmax+A, or declares that an assignment of makespan A does not exist. In each iteration, the 
algorithm hnds an augmenting path in Go-, from an overloaded machine to an underloaded 
machine, and pushes jobs along this path, by performing a series of job reassignments 
between machines on that path. This results in balancing the load over the machines, i.e., 
reducing the load of the source that is an overloaded machine, and increasing the load 
of the destination that is an underloaded machine, while preserving the load of all other 
machines. Figure [X2] gives a pictorial example of this operation. 

UBE terminates when there is no path from an overloaded machine to an underloaded 
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machine in G^, and this occurs after 0 {mS) steps, where S = Ylj&j I • Pij < o®} I- Let 
r be the resulting assignment after A) terminates. Then, it is shown in | GL+04| 

that if 7 ^ 0, then Topt > A. 

The procedure lABT combined with a binary search over the possible range of values for 
A, is used to identify the smallest A such that a call to UBTip, A) returns an assignment 
r with = 0. This yields the approximation ratio of 2 — ^ 7 ^- 

The running time of the approximation algorithm is factored by a value that is logarith¬ 
mic in the size of the range in which we search for A, e.g., [0, Thus, the algorithm 

of |GL-|-04| computes an assignment having makespan within a factor of 2 — ^ 7 ^ from the 
optimal in time 0{mSlogP), where P = 


Jobs machines 

o 



Figure 2.2: The bipartite graph Go-. By changing the orientation of the path za js —>■ i^, we 
remove job js from 15 and schedule it on machine 13 . 


2.3.1 Approximation Algorithm 

Let X be an instance of the restricted assignment problem. The feasibility parameter of 
X is exactly cp = (p(X) = ^ ^ pg initial feasible assignment for X, and 

consider the bipartite graph G^. 

Note that any feasible assignment a for an instance of the restricted assignment problem 
has an average machine load 

( 2 . 1 ) 


Thus, Lopt = T J2j&jPj for any instance X. 
Our algorithm proceeds as follows. 
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Algorithm 2 AuEsiX) _ 

1. FixA=L^J. 

2. Apply UBT{o^ A) and return the resulting assignment. 


Theorem 2.10. For any instance X with feasibility parameter 9 ?(X) , Algorithmic yields a 
schedule of makespan at most Pmax + = Pmax + rn^^{x) ^ time 0{m?n). 

Corollary 2.11. LetX, he an instances with feasibility parameter ip{X) = for some d = 

p 

1, Then Algorithm^ yields a schedule of makespan at most Pmax + ^ ■ Therefore, 

for instances with sufficiently large d (which is eguivalent to a large feasibility parameter), 
namely d such that d-pmax > 'Yljej'Pjj 'eoe get a schedule with a makespan better than twice 
the optimal makespan. Let | < r < 2, then for instances with feasibility parameter ^ such 
that d ■ {r — l)pmax > '^j(zjPj, Algorithmic is a r-approximation algorithm. 

The correctness of Theorem 12.101 follows from the next lemma. 

Lemma 2.12. Let X he an instance with feasibility parameter piX). Let a be an initial 
feasible assignment for X. Then [^^J) terminates with = 0. 

Proof. Let r be the assignment when UBT{o, L^^J) terminates. At this point,there is 
no path from a machine in to a machine in AA~{t) in the graph Gr- Assume 

that 7 ^ 0. Then there exists a machine i' with loadr{i') > Pmax + L^^J- Denote 

by A4.il the set of machines i such that Vi is reachable from Vii, then A4ii = {i E M : 
there is a directed path in from Uj/ to Uj }. 

Obviously, there exists a job j', such that T{j') = i'. Thus, there is an edge {vii,Uji) in 
Gr- Since ip is the feasibility parameter of X, there exists at least pm machines on which Ujt 
is feasible, i.e., there exists an edge from uy to each one of these machines. By appending 
each of these edges to {vii,Uji) we get a directed path from n*/ to at least pm machines 
(including vr). Therefore, we conclude that \A4i'\ > pm. 

Now, we compute a lower bound on the average machine load for r, by summing the 
loads of all the machines i G AA. We have that i' G A4^{t), thus loadr{i') > Pmax + 

Also, there is no path from Uj/ to machines in A4~{t), and therefore A4ii fl A4~{t) = 0. 
Thus, for all i G Aiii it holds that loadrii) > + 1- Hence, 

loadr{i) > loadr{i') + loadr{i) 

i€A4 ieA4^i,ipi' 

> Pmax + L^J + 1 + {\Mr\ - 1) ■ (L^J + 1) 

= Pmax + + 1) 

>Pmax + \Aiii\{{^-l) + l) 

>Pmax+mp{^) 

Pmax “1“ "^Lopt 
ttiLqpi 
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We have shown that the sum of loads of the assignment r is greater than mLopt- Hence, 
the average load for r is greater than Lopt- A contradiction. By 12.11 the average load of 
any schedule, and in particular r, cannot exceed Lopt. □ 

Proof of Theorem 12.101 Let X be an instance with feasibility parameter ip. Let a 
be some initial feasible assignment. By Lemma 12.121 when terminates, 

= 0. Therefore, the maximum load of the resulting assignment is at most Pmax + 
~ Pmax + L—running time of the algorithm equals to the running time 
of one call to which is 0 {mS), where S = I ^ Pp < oo} I- Since S < m ■ n 

for every instance X, the algorithm terminates after 0{m‘^n) steps. □ 

Note that our algorithm has better running time than the algorithm of | GL+04 |, since 
we use a single call to the procedure UBJ^, in contrast to the (2-1-)-approximation 

^ Pmax ' 

algorithm of |GL+04| , which uses binary search to hnd the best value for A, resulting in 
an overall running time of 0{mSlogP), where P = 'Yhj^jPj is the sum of processing times 
of all jobs. 
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Chapter 3 

Fixed-Parameter Algorithms for 
Scheduling on Unrelated Machines 

3.1 Preliminaries 

Some problems can be solved exactly, or approximately, by algorithms that are exponential 
only in the size of a hxed parameter while polynomial in the size of the inpnt. Snch 
an (approximation) algorithm is called a fixed-parameter traetable (FPT) (approximation) 
algorithm, because the problem can be (approximately) solved efficiently for small values 
of the hxed parameter. 

Problems in which some parameter k is hxed are called parameterized problems. A 
parameterized problem that allows for such an FPT algorithm is said to be a fixed-parameter 
traetable problem and belongs to the class FPT. 

We give some dehnitions formalizing this concept. 

Definition 3.1. A problem is said to be FPT if it ean be solved by an algorithm A that runs 
in time f{k) ■poly{\X\), for every instanee X with parameter k, and where f is a funetion 
independent of \X\. The algorithm A is ealled an FPT algorithm. 

We can similarly dehne FPT-approximation algorithms. 

Definition 3.2. A problem is said to have an r-FPT approximation algorithm, if there 
exists an r-approximation algorithm A to the problem, that runs in time f{k) ■poly{\X\), 
for every instanee X with parameter k, and where f is a funetion independent of \X\. 

Definition 3.3. A family {Ac : e > 0}, where Ac is a {l-\-e)-FPT approximation algorithm 
for all e > 0, is called a parametrized approximation scheme. 
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3.2 Parametrized Approximation Scheme for Schedul¬ 
ing on Unrelated Machines 

Consider the problem of minimizing the makespan on nnrelated machines, i.e., scheduling 
a set J oin jobs, j = 1,n, on a set Ai of m unrelated parallel machines, i G A4, where 
each job j has a processing time of pij on machine i and the objective is to hnd a schedule 
with minimum makespan. 

Our parameter k of a scheduling instance is the number of machine-job pairs {i,j) ^ 
M X J' such that pij > e ■ T for some e G (0,1] and a feasible value T. We will show that 
by rounding a solution to the MILP formulation of the problem where the k variables Xij 
such that Pij > e-T are integral, we can get an assignment with makespan at most (1 -l- e) 
the optimal makespan. 

Given a positive integer T, let Xij be an indicator to the assignment of job j on machine 
i. Consider the following mixed integer linear program. 


m 


MILP{e,T) : 

'^Xij = 1, 
i=l 

for j = 1, ...,n 


n 

J^PLXij < T, 

Xij > 0, 

^ij ^ 

for i = 1, ...,m 


for i,j such that pij < e - T 
for i,j such that Pij > e - T 


Xij = 0, if Pij > T, for i = 1, ...m , j = 1, ...,n 

One can see that integer solutions to the above MILP are in one to one correspondence 
with assignments a \ J ^ M. of makespan at most T, and that any feasible solution to 
MILP{e,T) has the property that the variables Xij, such that pij > eT, are integral. 


Theorem 3.1. Let X be a scheduling instance, let e G [0,1] and let T be a positive integer 
such that MILP{T,e) is feasible. Then an assignment of makespan at most (1 -|- e)T can 
be found in time 2^ ■ poly{\X\), where k = |{(i, j) : Pp > e • T}|. 


Proof. Let Xij, i = I, j = 1, ...,n be a solution to MILP{e,T). 

Then for every pair {i,j) E Ai x J' such that pij > e ■ T, the corresponding Xij is either 
0 or 1. Let /j be the number of jobs j for which pij > e-T on machine i and such that the 
corresponding variable Xij equals to 1. Recall the rounding technique as in Theorem 12.11 
Then the hrst U slots of machine i are full and therefore the capacity left in the slots for the 
other jobs, withp^j < e-T is 7" —From Theorem 12.11 we get that the jobs that 




are assigned by the rounding to the remaining slots h + 1,..., ki (where ki = ^ 
contribute at most p’A^l + T — Yhj-pijyexPij ^Fe load of machine i. Therefore the load of 


23 






machine i is at most 'Ej:p,^>eTPij + (pmtl +T- 'Zj:p,j>eTPij) = Pmax + T. Now, < 
e • T, and therefore the total load of machine i is at most (1 + e)T. 

The algorithm runs in time 2^ ■ poly{\X\), since the bottle-neck of the algorithm is ob¬ 
taining a feasible solution to the MILP. This is done by brute-force search of at most 2^ 
possible binary values for all the variables Xij corresponding to {i,j) G S^, by hxing them 
and hnding a solution for the resulting LP which can be done in polynomial-time in |X| 
[K84j. 

□ 


3.3 An FPT Algorithm for Graph-Balancing 

In this section we consider the special case of the restricted assignment problem, where 
each job can be assigned to at most two machines, with the same processing time on either 
machine. For this special case, Ebenlendr et al. |EKS08] presented a 1.75-approximation 
algorithm for the minimum makespan problem. 

An instance of the scheduling problem can be modeled as an undirected, multi-graph, 
with m nodes (a node for each machine) and n edges, such that every job j is associated 
with an edge of weight pj between both machine nodes on which it can be processed, or a 
loop of weight pj on the only machine node on which it can be processed. Minimizing the 
makespan is then equivalent to the problem of graph-balancing, i.e., of orienting each edge, 
such that the maximum weighted in-degree over all nodes is minimized. We exploit this 
graph representation of the problem to develop an FPT algorithm for this case. 

Definition 3.4. The maximum degree r of an undirected graph G = (V, E) is the maximum 
number of neighbors of any vertex, i.e., r = |A^(u)|, where N{v) = {e G E : 

vu,for some u G E}. 

We give below an FPT algorithm for graph balancing, where the parameters are the 
width of the tree decomposition of the graph, and the maximum degree of the graph. 

Note that we cannot hope for obtaining an FPT algorithm with the hxed parameter being 
only the maximum degree of the graph, as from the hardness proof for general instance 
[EKS08] , if follows that he problem is hard to approximate within a factor less than | even 
on bounded degree graphs, i.e., when the maximum degree is some constant. 

Intuitively, a tree decomposition represents the vertices of a given graph G as subtrees 
of a tree, in such a way that vertices in the given graph are adjacent only when the 
corresponding subtrees intersect. 

Definition 3.5. Given a graph G = {V,E), a tree decomposition is a pair {X,T), where 
X = {Xi, ...,Xt} is a family of subsets ofV (also called bags), and T is a tree whose nodes 
are the subsets Xt, satisfying the following properties: 

1. The union of all sets X, eguals V. That is, each graph vertex is associated with at 
least one tree node. 
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2. For every edge {v,w) in the graph, there is a subset Xi that contains both v and w. 
That is, vertices are adjacent in the graph only when the corresponding subtrees have 
a node in common. 

3. If Xi and Xj both contain a vertex v, then all nodes Xk of the tree in the (unigue) 
path between Xi and Xj contain v as well. That is, the nodes associated with vertex 
V form a connected subset of T. It can be stated eguivalently that if Xi, Xj and X^ 
are nodes, and X^ is on the path from Xi to Xj, then Xi n Xj C X^. 

The width of a tree decomposition is the size of its largest set X, minus one. The 
treewidth tw{G) of a graph G is the minimum width among all possible tree decompositions 
of G. 

It is observed in |B881 IN06] . that many algorithmic problems that are NP-complete for 
arbitrary graphs can be solved efficiently by dynamic programming for graphs of bounded 
treewidth, using the tree decompositions of these graphs. We show that the problem of 
graph balancing can be solved efficiently by dynamic programming for graphs of bounded 
treewidth and bounded vertex degree (the maximum number of neighbors of a vertex). 

Theorem 3.2. Let G be an undirected edge weighted multi-graph, with given tree decompo¬ 
sition ({X = {Xi, ...,Xt},T) of width w. Then the graph balancing problem parameterized 
by the graph treewidth and the maximum degree, r, is solvable in time 0(2^^'^ ■ wr ■ |X|. 

Proof. We show how the problem can be solved using dynamic programing on the tree 
decomposition of G. The idea is to examine for each bag X* G X all the possibilities of 
feasible assignments to the machines represented by the vertices in the bag X*, and the 
jobs represented by the edges of the subgraph G[Xj], induced by the vertices in bag X*. 
This information is stored in a table Bi corresponding to each bag Xj. The tables will 
be updated in a post-order manner, starting at the leaves of the tree decomposition and 
ending at the root. During this update process, it is guaranteed that local solutions for 
each subgraph corresponding to a bag of the tree decomposition are combined into a global 
optimal solution for the overall graph G. 

The algorithmic details are as follows. 

Step 0: Set an initial orientation on the edges of the graph G. For each bag Xj = 
{uj,...,u;j, |Xj| = Uj, let E[Xi] = {e{,...,el^J, \EG[Xi]\ = rui . Compute the following 
table of 2™* rows, and ruj -|- Uj -|- 1 columns: 
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The table consists of 2"'*^ rows and n^r + 1 columns. Each row represents an assignment 
to the sub-problem induced by the subgraph G[Xj]. Each row is a 0-1 sequence of length 
UiT that determines which of the edges in G[Xj] is directed oppositely than its direction in 
the given initial orientation (1 if it is the opposite orientation, and 0 otherwise). Formally, 
we can describe an assignment by a mapping 

Ai : E[X^ = ..., ^ { 0 , 1 }. 

Given the mapping Aj, let In{vj) denote the set of incoming edges for Vj, for Vj G Xj. The 
last column, Tj(), is the makespan of the assignment A^. 


Step 1 : Table initialization. 

For every table Bi and assignment Ai : E[Xi\ —)■ {0,1} set 

e£ln{vj) 


for every G Xj, where c(e) is the cost of edge e, which corresponds to the 

processing time of the job associated with the edge e. 


Step 2: Dynamic programming. 

We now go through the tree decomposition of G, from the leaves to the root, and compare 
the corresponding tables against each other. Let i be the parent node of j. We show how the 
table for Xj can be updated by the table for Xj. Assume that Xj = {-ui, ..,Us, nj,..., 
and Xj = and that E[Xi] = {ci,.., Cj, ej,..., and E[Xj\ = 

{Ci, .., Ct, 6]^, ..., €,rnj—t\- 

For each assignment A : {ei,...,et} {0,1}, and each extension Ai : E[Xi\ —)■ {0,1} 

of A, we consider an assignment Aj, which is an extension of A, that minimizes the new 
makespan, i.e., for each assignment Aj, which is an extension of A, we calculate 

e£E[Xi]r\E[Xj]AeGln{uk) 


for k = 1,s. Then, we calculate the makespan 

T/ = max [Tj{Aj), maxi<k<sli{uk), maxi<k<ni-sk{vl)] 

We update the entry for A^ with Ij and T- for j = argmin{T-). 

The values of li{v), v G Xj, and Tj() grows by the minimal value for the makespan of 
the assignment problem induced by all the vertices contained in the subtree rooted at node 
i. If i has several children ji ,..., j), then table Bi is updated successively against all tables 
Bj -^,..., Bj^ in the same way. All this is repeated until the root node is finally updated. 
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Step 3 : Construction of a minimum makespan assignment. 

The length of a minimum makespan assignment is derived from the minimal entry of 
the last column, Tj(), of the root node table. The assignment of the corresponding row 
shows where to assign the jobs represented by the edges in the subgraph induced by the 
vertices of the root bag. By recording in Step 2 how the respective minimum of each bag 
was determined by its children, one can easily determine the assignment of all edges in the 
graph. 

This concludes the description of the dynamic programming algorithm. It remains to 
show its correctness and running time. 

1. The hrst and second conditions in Definition 13.51 namely V = Ux^i and Ve G 
E 3Xi E X : e E E[Xi], guarantee that every machine and job in the instance is 
considered through the computation. 

2. The third condition in Dehnition 13.51 guarantees the consistency of the dynamic pro¬ 
gramming. If a vertex v E V occurs in two different bags and Xjj, then it is 
guaranteed that for the computed minimum makespan assignment only one set of 
jobs can be scheduled on the machine associated with that vertex v. 

As for the running time of the algorithm, for each edge (X,, Xj) in the tree decomposition 
of G, and for each of 2™* assignments A*, we go over all the assignments Aj that agree with 
Ai on the edges in E[Xi] p[E[Xj\ (at most 2"^^'), and do a computation of time Oijii -t-m,). 
This results in complexity of 0(2™'*’''™-’') • (n* -|- rrii) ■ |X|. Since < Uk ■ r and Uk <w for 
all Xfc G X, we have that the complexity of the algorithm is 0(2^^^ ■ wr • |X|). □ 
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Chapter 4 

Reoptimization Algorithms for 
Scheduling Problems 

4.1 Preliminaries 

Let !!/£) and denote the makespan minimization problems on identical and uniform 
machines, respectively. In the reoptimization model developed in [STT12] . we consider 
instances of the scheduling problem that can change dynamically over time. Our goal 
is to compute assignments within some guaranteed approximation for the new problem 
instances, derived from the previous instances. Since the transition from one assignment 
to another incurs some cost (for example, the cost of pausing the execution of a process 
on one machine and resuming its execution on another), an additional goal is to have the 
solution for the new instance close to the original one (under a certain distance measure). 

Let Xq = (Afo, Jo) be an instance of jobs and machines. Let uiq = |Alo| and uq = |j7o| 
and let ctq : Jo A4o be some initial assignment for Xq. We denote by X = (A4,J) a 
new instance derived from Xq by an admissible operation, e.g., addition or removal of jobs 
and/or machines. For any job j & J and a feasible assignment a \ J ^ A4, we are given 
the transition cost of j when moving from the initial assignment ag to a. We denote this 
transition cost by Ccrg(j, cr). The goal is to hnd an optimal assignment for X, for which the 
total transition cost, given by Xljej'is minimized. 

Recall that, given an optimization problem If, we denote by R(n) the reoptimization 
version of If. 

Definition 4.1. An algorithm A yields an {r, p)-reapproximation for RiliuJ) (Rijiuiq)), 
for r, p > 1, if for any instance X for (YIun), A outputs an assignment of makespan 
at most p times the minimal makespan for X, and of total transition cost at most r times 
the minimal transition cost to an optimal assignment for X. 

We consider below the case where transition costs can take values in {0,1}. In particular, 
job j € iX incurs a unit transition cost either if (i) f G Jo and is moved to a different machine 
in the schedule for /, or [ii) j ^ J \ Jo, i.e., j is assigned to a machine for the hrst time 
in the schedule for X. Otherwise, the transition cost for job j is equal to 0. Formally, 
c<To(i, cr) = 1 if j ^ Jo, or if j G Jo and cro(j) ^ a{j); otherwise, a) = 0. 
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Definition 4.2. A polynomial time reapproximation scheme (PTRS) for -R (H) is an al¬ 
gorithm that, given the inputs Xq and X for -R(n) and parameters €1,62 > 0 , yields a 
(1 + ei, 1 + € 2 )-reapproximation for R (11), in time polynomial in \Xq\ and \X\. 


4.2 A (1, l+e)-Reapproximation Algorithm for Makespan 
Minimization on Identical Machines 

We present below a reapproximation algorithm, Aid^ for the problem of minimizing the 
makespan on identical machines. The algorithm uses a relaxed packing of items in bins, 
where the items correspond to jobs, and the bins represent the set of machines. 


Definition 4.3. Given a set of bins, each of capacity K > t], and a set of items packed 
in the bins, we say that the packing is e- relaxed, for some e > 0, if the total size of items 
assigned to each bin is at most (1 + e)K. 


4.2.1 Algorithm 


Our algorithm for reoptimizaing makespan minimization on identical machines accepts as 
input the instances Xq and X, the initial assignment of jobs to the machines, Uo, and an 
error parameter e > 0. The algorithm proceeds as follows. 


We apply a (1+ e)-approximation algorithm [H96l|AA+98| on the new instance to obtain 
a solution of makespan T < (1 + Then, we split our instance into large and small 

jobs, round down the large jobs processing times (to have a polynomial-size collection of 
feasible configurations of large jobs on the machines), such that the load of each machine 
does not exceed T. Then, we iterate on this collection in order to find the configuration 
that minimizes the transition cost from the original solution. We prove that after we inflate 
the rounded jobs to their original processing times, and greedily assign all the small jobs 
within the configuration, the resulting makespan is at most (1 -|- 


We give below a detailed description of our algorithm. Aid- Let Cf^^„,(X) denote the 
minimum makespan for an instance X. For simplicity of the presentation, for the case 
where m < mo, we assume w.l.o.g. that the omitted machines are m -|- 1, m -|- 2,..., mo. 
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Algorithm 3 Aid{^oi'Ii0^o) 

1. Let eo = |. Use a PTAS for makespan minimization on identical machines to find 

U < (1 + eo)U^a^(X). 

2. Define aj = ^ for all j G JL, and represent each machine as a bin of unit capacity. 
Consider the jobs as items whose sizes are aj G (0,1]. 

3. An item j G JT” is small if it has a size at most Cq; otherwise, item j is large. 

4. Round down the sizes of the large items to the nearest multiple of Cq. Denote the 
rounded sizes for every large item j. 

5. For any feasible assignment of rounded large items in the m bins, given by the con- 
hguration C ={C^,..., C™}, do: 

(i) Let £ = max{mo,m}. Construct a complete bipartite graph G = {U,V,E), in 

which V = and U = {C^, ..., C^j. Each vertex i e V corresponds 

to the initial conhguration of machine i, given by Cq = {j G JTq : o'o(j) = i}, for 
1 < i < mo; if mo < m , set Cg = 0 for all mo + l < i < i. If mo > m, set C* = 0 
for all m + 1 < i < i. Dehne a cost on the edges (i, C^), for all 1 < i, A; < i, as 
follows. 

(a) Add the cost of large items that appear in but not in the initial conhg¬ 
uration Cg. 

(b) Add to C^ all the small items that appear in Cg but not in C^ and then 
omit the largest small items until the total size of C^ does not exceed 1. 
Add the cost of the omitted items. For an empty conhguration C^, the cost 
of the edge {i, C^), for all 1 < i < £ is equal to 0. 

(ii) Find a minimum cost perfect matching in the bipartite graph. 

(hi) Add to the solution the omitted small items using First-Fit. 

6. Choose the solution of minimum cost, and return the corresponding schedule of the 
jobs on the machines. 


4.2.2 Analysis 

Theorem 4.1. For any e > 0, Algorithm Am yields in polynomial time a (1,1 + e)- 
reapproximation for R (LI/x)), 

We prove the theorem using the next lemmas. 

Lemma 4.2. Let X = (Al, J) be an instance ofUjo, for which the minimum makespan is 

Let for all j G J; then, Y.jej W - 
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■ P' 

Proof. We note that the minimum makespan satishes ^ ■ Since T > we 

have that a.j = ^ < for all j G JT”, therefore, 

^ ^max 

jej jej 

□ 

Lemma 4.3. Let C be a feasible assignment of rounded large items on the m machines, 
given by the configuration C = to which we add in each bin the small items 

that were not omitted in Step 5 of Aid- Then C can be expanded in polynomial time to an 
Co-relaxed packing of all items in the input X. 

Proof. Let C = C^} be a feasible conhguration of the large rounded items, and 

let S‘^,..., S'^ be the subsets of small items added in Step 5 to the bins, to form C. Then 
C yields a feasible packing, i.e., for all 1 < i < m, 

jec» jes» 


Now, since aj > eo for all j G C\ the number of large items in bin i is bounded by [1/eoJ. 
Also, since aj — otj < Cq, for all j E L, we have that 


+‘'o)+^ + 

jec*us* ieC' jes^ jec^ 



+ ^ < 1 + Cq. 

jesi 


Hence, the packing of C is Co-relaxed. Now, we show that the packing remains eg-relaxed 
after we add the small items using First-Fit. Let ai, a 2 , denote the sizes of the items 
packed in C, and let at+i-, ctn be the sizes of the small unpacked items. By Lemma [4.21 
we have 

n 

(4.1) 

i=l 

We apply First-Fit in the following relaxed manner. Consider the next item in the unpacked 
list. Starting from bin 1, we seek the hrst bin in which the item can be added, such that 
the overall size of the items packed in this bin is at most 1 -|- Cq. Let be the total size 
of the items packed in bin i after adding the small items. Assume that, after we apply 
First-Fit, some small items remain unpacked (i.e., none of the bins can accommodate these 
items). Let ii, ...fin denote this subset of items. Then, we have that (1 -|- cq) — Vi < aj for 
all j = £,..., n and i = 1,..., m. It follows, that 

m 

(1 -I- co)m — Vi < maj Wj = i,... ,n (4.2) 

i=l 

By the dehnition of Uj, and since we packed all items up to Z£_i, 

m I—I 

i=l j=l 


(4.3) 
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From fl4.1l) . we get that 


n 


l-l 



i=i 




Then, from fl4.3p . we have 

n m 

m-^aj > 

j=£ i=l 

From (14.211 and (14.4p. it follows that, for all i<j<n, 

n 

m(l + eo) - (m - ^ aj) < m{l + eo) 
j=e 


m 

Tj < maj, 

i=l 


(4.4) 


or, 

n 

meo + oij < muj. (4.5) 

j=£ 

n 

From (14.5p . and since < eo for j = we have that meo + X) thns, 

j=£ 

n 

<0- ^ contradiction, since aj > 0 for all j. 

j=e 

Hence, the above relaxed implementation of First-Fit packs all of the remaining small 
items and yields a relaxed-packing of the original instance. □ 

Lemma 4.4. Let OPT be an optimal solution, and letC = {C ^,..., C'™} he the configuration 
of large items derived from OPT. Then, the cost of this optimal solution is at least the cost 
of the solution for C in Step 5 of Aid- 

Proof. C is an optimal confignration, therefore C is also a feasible confignration (since 
— ^)- ALGc be the solntion for C the algorithm generates in Step 5. 

j£Ci 

Assnme that Cost {OPT) < Cost {ALGc)- Note that the cost for the large items is the 
same in both solntions, and therefore the difference between the cost is caused by packing 
small items. We also note that, in the algorithm, the small items we pay for are those that 
are omitted from their original bin and transfered to a different one. Since the transition 
costs are 1, it means that the number of omitted small items in OPT is smaller than in 
ALGc, or in other words, the number of small items that are packed in their original bin in 
OPT is greater than their number in ALGc- Therefore, there must exist a bin 1 < i < m 
for which this holds. Consider the small items packed in bin i in each solution. Denote by 
qi, ...,qs the sizes of small original items of bin i that are packed in bin i in both solutions, 
by Pi, and pf, ...,pP two distinct sets of small original items of bin i that are packed 
in ALGc and in OPT, respectively, but not in both. Since the algorithm chooses to omit 
the largest small items first, the fact it chose to omit pf,...,pf but to keep pf,...,p^ means 
that pf, ...,pP are not smaller than pP, ...,p^. In particular, let pf = min {pf, ...,pp}, then 
we have 

Pi = min {pP, ...,pP} > max {pf,-,Pk} - (4.6) 
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Since < T, we get that 






T 

j&C'- i=i i=i 

The algorithm chose, in particular, to omit pf, hence 

s k 

+ Y.pf <1+^0 

j€Ci j = l j=l 

but also 

s k 

PJ + 5^ Qj + + P? > 1 + eo- 

iec* i=i i=i 

By the above discussion, we also have that 

E Pi + E ?i + Ef=iP/ + Pf ^ E Pi + E ?i + ^P? + P? 


iec* 


i=i 


14.61 iec* 


< 

fc<£ 

< 


i=i 

E Pi + E ?i 

iec* i=i 




(4.7) 


(4.8) 


(4,9) 


EPj + E® + Ej.iPy 

14.61 iec* j=i 

< 1 

03 

From the last inequality, we have a contradiction to fl4.9p . Hence, we have the statement 
of the lemma. □ 


Lemma 4.5. \EK7^ Let G = {V,U,E) be a bipartite graph with \V\ = \U\ = n and, and 
let c \ E ^ M. be a east function on the edges. A minimum cost perfect matching, i.e. a 
perfect matching M <Z E for which EeeM'^(^) minimized, can be found in 0{n^) time. 


Now, we are ready to prove the main theorem. 


Proof of Theorem 14.11 Let C = C^,..., be the conhguration of large items derived 
from an optimal solution, OPT. Let ALGc be the packing obtained for C in Step 5 of the 
algorithm. By Lemma 14.41 we have 

Gost (ALGc) < Gost {OPT). (4.10) 

Let ALG be the solution the algorithm outputs. 

Since Gost{ALG) = min {Gost{ALGc) : C is a legal configuration }, we have that Gost{ALG) < 
Gost{OPT). 

Now, it is easy to see that if we transform ALG to the schedule of the job on the 
machines (by returning to the original processing times), we get a solution of makespan 
at most (1 + eo)T. Since T < (1 + eo)Gf^,^„,, the algorithm is a (1 + cq)^- approximation 
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algorithm. We note that (1 + eo)^ = (1 + |)^ = (1 + | + -j^) < 1 + e, thus the algorithm 
yields a (1 + e)-approximation to 

We conclude that Aid is a (1,1 + e)-reapproximation algorithm for the reoptimization 
problem. 


Now, we prove that Aid runs in polynomial time. 


In the first step, we run a PTAS for the new instance X, therefore, this part is 
polynomial in X (but exponential in A). 

Steps 2,3 and 4 are clearly polynomial in |X|. 

For Step 5, we hrst show that the number of feasible conhguration of large rounded 
items is polynomial in X. Recall that the number of large rounded items in each 
bin can not exceed A Since the rounded sizes of large items can be of at most 

td- 


different sizes, the number of feasible conhgurations is at most R = ( - 1 '° )• 


There are at most m bins, implying that the number of feasible configurations is at 
j which is polynomial in m but exponential in X. Now, for each con- 


most 


figuration, we construct the bipartite graph in time polynomial in |X| and by |EK72] 
we find a perfect minimum cost matching in time O(k^) where k = max [m,m }, 
which is polynomial in |X|. 

Packing of the small items in each iteration using First-Fit is also done in time 
polynomial in |X| and Step 6 is clearly polynomial in |X|. 
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4.3 A (1, l+e)-Reapproximation Algorithm for Makespan 
Minimization on Uniform Machines 

We present below a reapproximation algorithm, Aun, for the problem of minimizing the 
makespan on uniform machines. The algorithm uses a relaxed packing of items in bins, 
where the items correspond to jobs, and the bins represent the set of machines. 

Definition 4.4. Given a set of m bins, with positive capacities Ki, K 2 ,K^, and a set 
of items packed in the bins, we say that the packing is e- relaxed, for some e > t), if for 
every 1 < i < m, the total size of items assigned to each bin is at most (1 + e)Ki. 


4.3.1 Algorithm 

Our algorithm for reoptimizing the makespan on uniform machines accepts as input the in¬ 
stances Xq and X, the initial assignment of jobs to the machines, (Tq, and an error parameter 
e > 0. 

In the previous section, it was convenient to convert the problem into a bin packing 
problem where all bins have equal size. We will consider the conversion to a bin packing 
problem also in the case of scheduling on unrelated machines, only here the bins will have 
variable sizes. In the generalization of the equal-size bin case to the variable-size case, 
we come across a major obstacle. The size of the subintervals in which the pieces are 
partitioned depends on the size of the bins in which the pieces are to be packed. Moreover, 
the definition of large and small pieces depends on the size of the bin in which the pieces 
are to be packed. Hochbaum and Shmoys |HS88] presented a PTAS for the problem of 
makespan minimization on uniform machines. They constructed, in polynomial time, a 
layered directed graph, with two nodes designated “initial” and “success”, such that there 
exists a path from “initial” to “success” in the graph if and only if there is a schedule with 
makespan at most 1 -1- e times the minimal makespan. From this path one can also dehne 
the configuration of “medium” and “large” jobs on each machine, and it is guaranteed that 
the “small” jobs can be scheduled. We add suitable costs to the edges of the layered graph 
and show that a feasible solution with optimal cost can be obtained by finding the lightest 
path from “initial” to “success” in the layered graph. It is also guaranteed that if there is 
a path from “initial” to “success” in the graph, then there is enough space in the bins to 
add the remaining pieces (e.g., using First-Fit), in the bins on which they are considered 
small. This way, as before, we can greedily pack the small pieces in their original bin (as 
long as we do not exceed its capacity), and pay only for packing the rest. 


HS88 


Overview of the PTAS of 

each machine i as a bin of size s 


Assume that Si is the highest speed, and represent 
Normalize the bin sizes by Si. Consider the jobs as 


items whose sizes are in (0,1] (by normalizing them by some upper bound on the minimal 
makespan). Round down piece sizes in to the nearest multiple of for some 

integer k > 0. 


For a bin of size s, G fe 


,k+l 




define: 
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• Pieces of sizes in are large for the bin. 

• Pieces of sizes in are medium for the bin. 

• Pieces of sizes less than or equal to are small for the bin. 

For convenience, the interval is referred as interval k. For pieces in interval k, 

a bin is large if it is in interval /c, huge if it is in interval k — 1, and enormous if it is in 
intervals 0,..., fc — 2. 

A directed layered graph, Gl = is then constructed, in which each node is 

labeled with a state vector describing the remaining pieces to be packed as large or medium 
pieces. The graph is grouped into stages, where a stage specihes the large and medium pack 
of bins in one interval Each layer within a stage corresponds to packing a bin 

in the corresponding interval. Both the bins within the stage and the stages are arranged 
in order of decreasing bin size. The state vector associated with each node is of the form 
{L; M]Vi,V 2 ,V), where L and M are vectors, each describing a distribution of pieces in 
the subintervals of and respectively. There are two nodes designated 

“initial” and “success”, such that “initial” is connected to the initial state vectors of the 
hrst stage, and every hnal state vector of the hnal stage is connected to “success”. A path 
from “initial” to “success” in specifies a packing of the rounded medium and large 
pieces for every bin. 

We note that, after the large and medium pack of the bins in interval k, we must allow 
for the packing of the remaining pieces in interval k that will be packed as small. These 
pieces must be packed in enormous bins for them, therefore, we need to have sufficient 
unused capacity in the enormous bins to at least contain the total size of these unpacked 
pieces. This is represented by the value Vi in the state vector; it records the slack, or unused 
capacity in the partial packing of the enormous bins with large and medium pieces. For 
stages corresponding to intervals greater than k, we also need to have the unused capacity 
in the huge and large bins, and this is the role of V 2 and V, respectively. Since we must 
represent the possible values in some compact way, we consider the sizes of the pieces that 
will be packed as small pieces into this as-yet-unused capacity. For Vi, pieces in interval 
k are small, and thus all pieces to be packed into this unused capacity have rounded sizes 
that are multiples of as a result, it will be sufficient to represent Vi as an integer 
multiple of Similarly, V 2 and V will be represented as integer multiples of and 
^A:+4, respectively. 

The following lemmas, due to |HS88] . will be useful in analyzing our algorithm. 

Lemma 4.6. Given e > 0, the layered graph Gl has 0{2m ■ . 1/e®) nodes and 

0{2m{n/edges. The number of nodes in each layer, which is the number of state 
vectors corresponding to packing the bins of that layer, is 0{nfl’" {n/e^Y). 

Lemma 4.7. For any e > 0, there is a one to one correspondence between paths from 
“initial” to “success” in the layered graph Gl and e-relaxed packings. 

Lemma 4.8. Given an e-relaxed packing of rounded piece sizes, for some e > 0, restoring 
the piece sizes to their original size yields a (2e + e^)-relaxed packing of the original pieces. 
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We give below a detailed description of our algorithm, Aun- Let denote the 

minimum makespan for an instance X. 


Algorithm 4 At/Ar(Xo,X, (Tq) 

1. Let e > 0. Use a PTAS for makespan minimization on uniform machines to hnd 

r < (1 + 

2. Let Si be the largest speed, and assume that ^ is bounded from above by some 

given constant h>l. Normalize all the processing times by si -T, and represent each 
machine as a bin of capacity ^ < 1. Consider the jobs now as pieces with sizes in 
(0,1]. Denote the new bin sizes by | < < ••• < Si = 1, and denote the 

piece sizes by pi,p 2 , 

3. Round down the piece sizes pj G to the nearest multiple of 

4. Construct the directed layered graph and dehne edge costs as follows. Consider 

the edge connecting a node of the (£ — l)’th layer to a node of the f'hh layer in 
some stage that corresponds to interval k. describes the large and medium pack of 
the £’th bin of stage k. Let be the set of pieces packed in that bin in the initial 
solution. Fix the cost on that edge to be the number of large and medium pieces 
that appear in Gq^, but are not packed in the bin by e^. All other edges in Gl are 
assigned the cost zero. 

5. For every choice of exactly one update arc, at the end of each stage, do: 

(i.) Find the lightest path from “initial” to “success” in the graph (if one exists). 
Dehne the corresponding partial solution consisting of pieces that are packed as 
large or medium. 

(ii.) Add the remaining pieces greedily to the enormous bins for them: for each bin 
i, let S'oj be the set of remaining pieces that belong to the bin in the initial 
solution and that are small for this bin. Start packing the pieces in Soi in bin i, 
in increasing order of piece sizes, and stop after the total size of packed pieces 
exceeds for the hrst time the bin capacity. Pack all the remaining pieces, as 
small, using First-Fit. 

6. Return the solution of minimum cost. 


4.3.2 Analysis 

Theorem 4.9. For any e > 0, Algorithm Aun yields in polynomial time a (1,1 -l- e)- 
reapproximation for R (fljjAr). 

We prove the theorem using the next lemmas. 
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Lemma 4.10. Given the partial packing of medium and large pieces (after Step 5(i) in 
Aun), packing the remaining pieces greedily as small, in Step 5(ii), incurs the minimal 
cost for packing these pieces. 

Proof. For the partial packing of medium and large pieces, let R be the set of the remaining 
pieces. We show that packing the pieces in R in enormous bins for them, by the greedy 
algorithm given in Step 5(ii), results in the minimal cost for packing R. The greedy 
algorithm hrst packs every bin with its original pieces that are small. It sorts them in non¬ 
decreasing order by piece size and packs them to the original bin by this order, until the 
bin capacity is exceeded for the hrst time (or all pieces are packed). This way we guarantee 
that we pack to every bin the maximal number of small pieces that were originally packed 
in it. Thus, the number of pieces, packed as small, not in their original bin, is minimized. 
This also implies a minimum packing cost for R. □ 

Lemma 4.11. Let OPT be an optimal solution. Then, the cost of OPT is at least the cost 
of the solution obtained by Aun- 

Proof. Let C = {C^, ..., be the conhguration of large and medium items derived from 
OPT. Since C is optimal, in particular, it is a truly feasible conhguration of large and 
medium pieces that also leaves enough slackness for packing the remaining pieces as small, 
without exceeding the capacity of the bins. Thus, by Lemma 14.71 there is a path from 
“initial” to “success” in the layered graph, such that C is the large and medium pack 
derived from it. 

This path also contains exactly one update arc after each stage. Therefore, our algorithm, 
in particular, considers this choice of update arcs for which it hnds a lightest path in the 
graph (which clearly exists for this choice), by Lemma [4. 101 the cost of the solution output 
by the algorithm is at most the cost of the lightest path plus the minimal cost of packing 
all the remaining pieces as small, which is clearly at most the cost of OPT. □ 

Lemma 4.12. For any partial solution of large and medium rounded pieces, derived from 
a path from “initial” to “success” in Gu, all the remaining rounded pieces can be packed 
in enormous bins for them, using the greedy algorithm in Step 5(ii), such that the load on 
each bin of size Si is at most 5,(1 -|- e -|- e^). 

Proof. As shown in |HS88] . the small-pack phase, which is done after every stage, is always 
successfully completed, since there is an update arc if and only if there is sufficient total 
slack to accommodate all pieces to be packed as small. Using a similar argument, Aun is 
able to pack all the remaining small pieces after packing all the large and medium pieces. 
Consider the rounded pieces packed into a bin of size Sj, which is in interval k. Focus on 
the small piece j that, when added to the bin, exhausts the usable slack. Then, piece j 
is of size less than or equal to and before piece j was added, the rounded piece sizes 
did not exceed s, -|- (recall that the usable slack is rounded up to a multiple of 
Hence, the bin contains pieces of total (rounded) size at most s, -|- Therefore, 

if Bi is the set of pieces packed in bin i, then the sum of the rounded piece sizes in Bi is at 
most Si + which is at most s, -|- es, -f e^s, = s,(l -|- e -|- e^), since s, > . □ 
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Lemma 4.13. The number of different choices for update arcs, one after each stage, is at 
most 

{n/e^YY), where S is the number of stages in the graph. 

Proof. Between every two stages, there are [nje^Y) update arcs, same as the number 

of state vectors in each layer (Lemma ll.hh . Therefore, the number of possibilities for 
choosing one arc in each stage is (n/e^)^)'^). □ 

Lemma 4.14. The number of stages S in Gl depends only on e and on b. 

Proof. The number of stages in Gl is the number of intervals k that contain at least one 
bin. Since the smallest bin size satishes Sm > *5' is at most min{k G N : < il- ^ 

Now, we are ready to prove the main theorem. 

Proof of Theorem 14.9t By Lemma 14.111 taking the best solution among all lightest 
paths (one for each choice of update arcs), we have a solution whose transition cost is at 
most the transition cost of an optimal solution. By Lemma 14.121 this solution is also an 
(e + e^)-relaxed packing of the rounded pieces, and by Lemma [4.81 after inflating the pieces 
to their original sizes, we get a (2(e + e^) + (e + e^)^)-relaxed packing. Hence, by fixing 
a suitable initial e, e.g., e = eo/8, we conclude that Aun is a (1,1 + eo)-reapproximation 
algorithm for our reoptimization problem. 

We note that Aun runs in polynomial time. Constructing the graph Gl = {Vl,El) 
can be done in time linear in the graph size, which is Vl = 0 {2m ■ r?!'^ • 1/e®) and 

El = Oifl'nilnj i^^), by Lemma 14.61 By Lemma 14.131 the number of different 
choices of update arcs, is X = Oifnf!'" (n/e^)®)'^), where S is the number of stages in 
the graph. By Lemma 14.141 S depends only on e and b. Therefore, the complexity 
of running e.g., Dijkstra’s algorithm |D59] for hnding a lightest path from “initial” to 
“success” and then greedily packing the small pieces, for every choice of update arcs, is 
0{X ■{Dijkstra{GL)+Greedy{n, m))). Overall, we get that the complexity of the algorithm 
is 0(X ■ {Dijkstra{GL) + Greedy{n,m))), which is polynomial in the input size. 
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Chapter 5 

Conclusions and Future Work 


5.1 Summary 

In this work we studied the fundamental problem of makespan minimization on parallel 
machines. For the unrelated machines model, we derived an improved bound, which de¬ 
pends on the minimum average machine load and the feasibility parameter of the instance. 
Our bound is strictly smaller than 2, the best known general upper bound, for a natural 
subclass of instances. 

We further studied the power of parameterization and presented an FPT algorithm 
and a parameterized approximation scheme for instances that are known to be hard to 
approximate within factor |, based on classical complexity theory. 

Finally, we initiated the study of the reoptimization variants of makespan minimization 
on identical and uniform machines, and derived approximation ratios that match the best 
known ratios in these models. 


5.2 Future Work 

Our study leaves open several avenues for future research. 

Makespan minimization on unrelated machines While reducing the gap between 
the general lower bound of | and the upper bound of 2 remains a prominent open problem, 
it would be interesting to tighten these bounds for other non-trivial subclasses of instances, 
either by a constant, or by a function of the input parameters. 

Another interesting direction is to study instances with a wider range of feasibility pa¬ 
rameters. In this work, we explored instances having large feasibility parameter. On the 
other hand, Ebenlendr et ah |EKS08j considered instances in which the feasibility param¬ 
eter is at most Any results on other, intermediate values of this parameter, would shed 
more light on its role in obtaining better schedules. 

Other natural questions are: “Can we obtain better bound for fully-feasible instances, 
when Topt = LopiT' “Can we derive a better lower bound for general instances for which 
T — T ?” 

opt — ^opt • 
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FPT algorithms for scheduling The problem of minimizing the makespan on nnre- 
lated machines is NP-hard even if the nnmber of machines or nnmber of jobs is taken as a 
parameter jLSTQOj . therefore no FPT algorithm exists with only these choices of parame¬ 
ters. In addition, we cannot hope for obtaining an FPT algorithm for graph balancing with 
the hxed parameter being only the maximnm degree of the graph. Indeed, by the hardness 
proof of |EKS08j . the problem is hard to approximate within a factor less than | even on 
bonnded degree graphs, i.e., when the maximnm degree is some constant. The qnestion 
whether we can End an FPT algorithm for graph balancing with the treewidth being the 
only parameter remains open. 

Reoptimization in scheduling We list some of the questions arising from our results. 
“Can we obtain reapproximation algorithm with the same performance guarantees for gen¬ 
eral instances in the uniform machines model?” “Can we extend our results to arbitrary 
transition costs?”. Finally, “Can we improve the running times of our reapproximation 
schemes, by adjusting the known EPTASs for identical and uniform machines to the reop¬ 
timization model?” 
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